SEQUENCE LISTING 



GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Gene Shears Pty. Limited 

(B) STREET: Suite 1, Building 5, 105 Delhi Road 

(C) CITY: North Ryde 

(D) STATE: North Ryde 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP) : NSW 2113 

(A) NAME: PAUL, Wyatt 

(B) STREET: c/o Nickerson Biocem Ltd, Cambridge Science 

Park, Milton Rd 

(C) CITY: Cambridge 

(D) STATE: Cambridge 

(E) COUNTRY: UK 

(F) POSTAL CODE (ZIP) : CB4 5GZ 

(A) NAME: PEREZ, Pascaul 

(B) STREET: c/o Biogemma, Campus Universitaire des 

Cezeaux 

(C) CITY: 24 Avenue des Landais 

(D) STATE: Aubiere 

(E) COUNTRY: France 

(F) POSTAL CODE (ZIP): 63170 

(A) NAME: HUTTNER, Eric 

(B) STREET: c/o Groupe Limagrain Pacific Pty Ltd, GPO Box 

475 

(C) CITY: Canberra 

(D) STATE: Canberra, ACT 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP) : 2 601 

(A) NAME: BETZNER, Andreas Stefan 

(B) STREET: Groupe Limagrain Pacific Pty Ltd, GPO Box 475 

(C) CITY: Camberra 

(D) STATE: Camberra, ACT 

(E) COUNTRY: Australia 

(F) POSTAL CODE (ZIP) : 2601 

(ii) TITLE OF INVENTION : Protein Complementation 
(iii) NUMBER OF SEQUENCES: 68 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB98/00542 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION : 9 . .34 4 

(D) OTHER INFORMATION :/product= "Barnase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCTAGACC ATG GCA CAG GTT ATC AAC ACG TTT GAC GGG GTT GCG GAT TAT 
Met Ala Gin Val lie Asn Thr Phe Asp Gly Val Ala Aso Tyr 
15 10 

CTT CAG ACA TAT CAT AAG CTA CCT GAT AAT TAC ATT ACA AAA TCA GAA 
Leu Gin Thr Tyr His Lys Leu Pro Asp Asn Tyr He Thr Lys Ser Glu 
15 20 25 30 

GCA CAA GCC CTC GGC TGG GTG GCA TCA AAA GGG AAC CTT GCA GAC GTC 
Ala Gin Ala Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val 
35 40 45 

GCT CCG GGG AAA AGC ATC GGC GGA GAC ATC TTC TCA AAC AGG GAA GGC 
Ala Pro Gly Lys Ser He Gly Gly Asp He Phe Ser Asn Arg Glu Gly 
50 55 60 

AAA CTC CCG GGC AAA AGC GGA CGA ACA TGG CGT GAA GCG GAT ATT AAC 
Lys Leu Pro Gly Lys Ser Gly Arg Thr Trp Arg Glu Ala Asp He Asn 
65 70 75 

TAT ACA TCA GGC TTC AGA AAT TCA GAC CGG ATT CTT TAC TCA AGC GAC 
Tyr Thr Ser Gly Phe Arg Asn Ser Asp Arg He Leu Tyr Ser Ser Asp 
80 85 90 

TGG CTG ATT TAC AAA ACA ACG GAC CAT TAT CAG ACC TTT ACA AAA ATC 
Trp Leu He Tyr Lys Thr Thr Asp His Tyr Gin Thr Phe Thr Lys He 
95 100 105 HO 

AGA TAA 
Arg 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ala Gin Val lie Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gin 
15 10 15 

Thr Tyr His Lys Leu Pro Asp Asn Tyr He Thr Lys Ser Glu Ala Gin 
20 25 30 

Ala Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro 
35 40 45 

Gly Lys Ser He Gly Gly Asp He Phe Ser Asn Arg Glu Gly Lys Leu 
50 55 60 

Pro Gly Lys Ser Gly Arg Thr Trp Arg Glu Ala Asp He Asn Tyr Thr 
65 70 75 80 

Ser Gly Phe Arg Asn Ser Asp Arg He Leu Tyr Ser Ser Asp Trp Leu 
85 90 95 

He Tyr Lys Thr Thr Asp His Tyr Gin Thr Phe Thr Lys He Arg 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION :1. .18 

(D) OTHER INFORMATION : /note* "Figure 1A: Bl primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CAT GGT CTAG AGTACTTG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION :1. .16 

(D) OTHER INFORMATION :/note= "Figure 1A: B4 primer" 



{ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCAGCCGAGG GCTTGT 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. .16 

(D) OTHER INFORMATION: /note= "Figure 1A: B2 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GC AT CAAAAG GGAACC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY : mi sc_f eature 

(B) LOCATION:!. .228 

(D) OTHER INFORMATION :/note= "Figure IB: Intergenic 
Sequence " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CGAAAAAAAC GGCTTCCTGC GGAGGCCGTT TTTTTCAGCT TTACATAAAG TGTGTAATAA 60 

ATTTTTCTTC AAACT CTGAT CGGTCAATTT CACTTTCCGG ATCCGGTCCA AT CT GCAGCC 120 

GTCCGAGACA GGAGGACATC GTCCAGCTGA AACCGGGGCA GAATCCGGCC ATTTCTGAAG 18 0 

AGAAAAATGG TAAACTGATA GAATAAAATC ATAAGAAAGG AGCCGCAC 228 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 1. .273 

(D) OTHER INFORMATION: /product^ "Barstar" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG AAA AAA GCA GTC ATT AAC GGG GAA CAA ATC AGA AGT ATC AGC GAC 4 8 

Met Lys Lys Ala Val He Asn Gly Glu Gin He Arg Ser He Ser Asp 
115 120 125 

CTC CAC CAG ACA TTG AAA AAG GAG CTT GCC CTT CCG GAA TAC TAC GGT 96 
Leu His Gin Thr Leu Lys Lys Giu Leu Ala Leu Pro Glu Tyr Tyr Gly 
130 135 140 

GAA AAC CTG GAC GCT TTA TGG GAT TGT CTG ACC GGA TGG GTG GAG TAC 144 
Glu Asn Leu Asp Ala Leu Trp Asp Cys Leu Thr Gly Trp Val Glu Tyr 
145 150 155 160 

CCG CTC GTT TTG GAA TGG AGG CAG TTT GAA CAA AGC AAG CAG CTG ACT 192 
Pro Leu Val Leu Glu Trp Arg Gin Phe Glu Gin Ser Lys Gin Leu Thr 
165 170 175 



GAA AAT GGC GCC GAG AGT GTG CTT CAG GTT TTC CGT GAA GCG AAA GCG 
Glu Asn Gly Ala Glu Ser Val Leu Gin Val Phe Arg Glu Ala Lys Ala 
180 185 190 



240 



GAA GGC TGC GAC ATC ACC ATC ATA CTT TCT TAA TACGATCAAT GGGAGATGAA 293 
Glu Gly Cys Asp He Thr He He Leu Ser 
195 200 

CAATATAGAT CCCCCGGGCT GCAGGAATTC 32 3 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys Lys Ala Val He Asn Gly Glu Gin He Arg Ser He Ser Asp 
1 5 10 15 

Leu His Gin Thr Leu Lys Lys Glu Leu Ala Leu Pro Glu Tyr Tyr Gly 
20 25 30 

Glu Asn Leu Asp Ala Leu Trp Asp Cys Leu Thr Gly Trp Val Glu Tyr 
35 40 45 

Pro Leu Val Leu Glu Trp Arg Gin Phe Glu Gin Ser Lys Gin Leu Thr 
50 55 60 



Glu Asn Gly Ala Glu Ser Val Leu Gin Val Phe Arg Glu Ala Lys Ala 
65 ™ 75 80 



Glu Gly Cys Asp lie Thr He He Leu Ser 
85 90 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION :/note= "Figure 1C: B3 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TAATACGATC AAT GGGAGAT G 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 9- .194 

(D) OTHER INFORMATION :/note= "Figure ID" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TCTAGACC ATG GCA CAG GTT ATC AAC ACG TTT GAC GGG GTT GCG GAT TAT 50 
Met Ala Gin Val lie Asn Thr Phe Asp Gly Val Ala Asp Tyr 
95 100 105 

CTT CAG ACA TAT CAT AAG CTA CCT GAT AAT TAC ATT ACA AAA TCA GAA 98 
Leu Gin Thr Tyr His Lys Leu Pro Asp Asn Tyr He Thr Lys Ser Glu 
HO 115 120 

GCA CAA GCC CTC GGC TGG ATG GGC GGT GGC GGT TCC GGT GGC GGT GGC 146 
Ala Gin Ala Leu Gly Trp Met Gly Gly Gly Gly Ser Gly Gly Gly Gly 
125 130 135 



( 



AGC GGC GGC GGT GGT AGC GGG ATC CCC GGG TAG GGT CAG TCC CTT ATG 
Ser Gly Gly Gly Gly Ser Gly lie Pro Gly Tyr Gly Gin Ser Leu Met 
140 145 150 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Met Ala Gin Val lie Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gin 

1 5 - 10 15 

Thr Tyr His Lys Leu Pro Asp Asn Tyr He Thr Lys Ser Glu Ala Gin 
20 25 30 

Ala Leu Gly Trp Met Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
35 40 45 

Gly Gly Gly Ser Gly He Pro Gly Tyr Gly Gin Ser Leu Met 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1, .526 

(D) OTHER INFORMATION :/note= "Figure IE" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TCTAGACCAT 


GCAGAT CTT C 


GTGAAAACCT 


TGACCGGCAA 


GAC CAT CACT 


CTCGAGGTCG 


60 


AGAGCAGCGA 


CACCATCGAC 


AATGTCAAGG 


CCAAGATCCA 


AGACAAAGAA 


GGTATCATTC 


120 


TTCCTCACTC 


AATCTGGATT 


CTTCTCTTTA 


GCTTTTTGAA 


ATTCAGATCT 


CTTATCATTT 


180 


ACTTGTTTCT 


CCTTTAAGGA 


ATCCCTCCGG 


AT CAGCAGAG 


ATT GAT CTT C 


GCCGGAAAGC 


240 


AGCT CGAAGA 


TGGCCGTACT 


TTGGCTGACT 


ACAACATCCA 


GAAAGGTACG 


AAATCATCCG 


300 


AATCCTTCTG 


TTGATCATTT 


CGATGATCTG 


ATT GTATAAA 


CTCTAATGGA 


TTGTTATCAT 


360 


TTGTAAACAG 


AAT CTACACT 


TCATCTTGTG 


TTGAGGCTTA 


GAGGTGGAGC 


ACAGGTTATC 


420 


AACACGTTTG 


ACGGGGTTGC 


GGATTATCTT 


CAGACATAT C 


ATAAGCTACC 


T GAT AATTAC 


480 




ATTACAAAAT CAGAAGCACA AGCCCTCGGC TGGATGTAGA GGATCC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STFANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. . 631 

(D) OTHER INFORMATION :/note= "Figure IF" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCTAGACCAT GCAGATCTTC GTGAAAACCT 
AGAGCAGCGA C CAT CGACAA TGTCAAGGCC 
CCTCACTCAA TCTGGATTCT TCTCTTTAGC 
TTGTTTCTCC TT TAAGGAAT CCCTCCGGAT 
CTCGAAGATG GCCGTACTTT GGCTGACTAC 
TCCTTCTGTT GATCATTTCG AT GAT CT GAT 
GTAAACAGAA TCTACACTTC ATCTTGTGTT 
CCTTGCAGAC GTCGCTCCGG GGAAAAGCAT 
CAAACTCCCG GGCAAAAGCG GACGAACATG 
CTTCAGAAAT TCAGACCGGA TTCTTTACTC 
CCATTAT CAG ACCTTTACAA AAATCAGATA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1. .20 

(D) OTHER INFORMATION : / note= "Figure 1G: B5" 



TGACCGGCAA GACC AT CACT CTCGAGGTCG 60 

AAGATCCAAG ACAAAGAAGG TATCATTCTT 120 

TTTTTGAAAT TCAGATCTCT TAT C AT TT AC 18 0 

CAGCAGAGAT TGATCTTCGC CGGAAAGCAG 240 

AACATCCAGA AAGGT AC GAA AT CAT C CGAA 300 

TGTATAAACT CTAATGGATT GTTATCATTT 360 

GAGGCTTAGA GGT GGAGCAT CAAAAGGGAA 420 

CGGCGGAGAC ATCTTCTCAA ACAGGGAAGG 4 80 

GCGTGAAGCG GATATTAACT ATACATCAGG 540 

AAGCGACTGG CTGATTTACA AAACAACGGA 600 

A 631 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CACAAGTACT CTAGACCATG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 1. . 19 

(D) OTHER INFORMATION : /note= "Figure 1G: B6" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CATCCAGCCG AGGGCTTGT 

(2) INFORMATION FOR SEQ ID NO: 16: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. . 16 

(D) OTHER INFORMATION :/note= "Figure 1G: B7" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGCGGTGGCG GTTCCG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: miscjeature 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION :/note= "Figure 1G: B8" 




(ix) 



f 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CCACTAGTTC TAGAGTACTT GTG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : miscjeature 

(B) LOCATION : 1 . .18 

(D) OTHER INFORMATION :/note= "Figure 1G: B9" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 
GCACAGGTTA TCAACACG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : miscjeature 

(B) LOCATION : 1 . .31 

(D) OTHER INFORMATION : / note= "Figure 1G: BIO" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCGGATCCTC TACAT CCAGC CGAGGGCTTG T 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc feature 



r 



(B) LOCATION :1. .16 

(D) OTHER INFORMATION : / note— "Figure 1G: Bll" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCATCAAAAG GGAACC 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION :1. ,17 

(D) OTHER INFORMATION: /note= "Figure 1G: B12" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGTCTAGAGT ACTTGTG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 1. .30 

(D) OTHER INFORMATION: /note= "Figure 1G: Ubql6F" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCTCTAGACC AT GCAGAT CT TCGTGAAAAC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



( 



(ix) FEATURE : 

(A) NAME/KEY: mis c_ feature 

(B) LOCATION : 1 . .25 

(D) OTHER INFORMATION : / note= "Figure 1G: UbqlR" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTGGATCCAC CTCTAAGCCT CAACA 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 
<B) LOCATION: 1. .24 

(D) OTHER INFORMATION :/note= "Figure 1G: Ubqla" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

TATGGATCCC CCGGGCTGCA GGAA 

(2) INFORMATION FOR SEQ ID 'NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION: /no te= "Figure 1G: Ubqlb" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCCACCTCTA AGCCTCAACA C 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION :/note= "Fig 3A: lane 1, P 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GCGGATCCAT GAAGGAG AC C GCC 
(2) INFORMATION FOR SEQ ID NO: 27: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .56 

(D) OTHER INFORMATION : / note= "Fig 3A; lane 2, RNI" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCGGATCCAT GAAGGAGACC GCCGCCGCCA AGTTCGAGCG C CAGCAC AT G GACAGC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : mis cofeature 

(B) LOCATION:!. .22 

(D) OTHER INFORMATION : / note= "Fig 3A: lane 3, RNI" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CATAGAT CTT TAGCTGTCCA TG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. .28 

(D) OTHER INFORMATION :/note= "Fig 3A: lane 4, P 
RN-d" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



CCAGATCTAT GAGCTCCTCC AACTACTG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : mis c_f eature 

(B) LOCATION : 1 . .63 

(D) OTHER INFORMATION :/note= "Fig 3A: lanes 2/5, RNII" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



TAAAGATCTA TGAGCACCTC CGCCGCCAGC TCCTCCAACT ACT GCAAC C A GATGATGAAG 60 
TCT 63 
(2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION :/note= "Fig 3A: lane 6, RN2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TCAGGTTCCT AGACTT CAT C A 
(2) INFORMATION FOR SEQ ID NO: 32: 



(. 



(i) SEQUENCE CHARACTERISTICS-: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<ix) FEATURE : 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION : 1 , .59 

(D) OTHER INFORMATION :/note= "Fig 3A, lanes 5/7, RNIII" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AGGAACCTGA CCAAGGACAG GTGCAAGCCA GTCAACACCT TCGTCCACGA GAGCCTGGC 59 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: mis cofeature 

(B) LOCATION: 1. .19 

(D) OTHER INFORMATION : / note= "Fig 3A: lane 8, RN3 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTGGACATCG GCCAGGCTC 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 1. .48 

(D) OTHER INFORMATION :/note= "Fig 3A, lanes 7/9, RN IV" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CGATGTCCAG GCCGTCTGCA GC CAGAAGAA CGTGGCCTGC AAGAACGG 



( 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION : / note= "Fig 3A: lane 10, RN 4" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
AGTTGGTCTG ACCGTTCTTG C 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION : 1 . . 60 

(D) OTHER INFORMATION: /no te= "Fig 3A: lanes 9/11, RN V" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TCAGACCAAC TGCTACCAGT CCTACAGCAC CATGTCCATC ACCGACTGCC GCGAGACCGG 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: mi sc_f eature 

(B) LOCATION : 1 . .19 

(D) OTHER INFORMATION: /note= "Fig 3A: lane 12, RN5" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



CTTGCTGGAG CCGGTCTCG 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION: 1. .55 

(D) OTHER INFORMATION :/note= "Fig 3A: lanes 11/13, RN 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CTCCAGCAAG TACCCTAACT GCGCCTACAA GACCACCCAG GCCAACAAGC ACATC 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: misc_feature 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION :/note= "Fig 3A: lane 14, RN 6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CAGGCAACAA TGATGTGCTT G 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 1. .24 

(D) OTHER INFORMATION : / note= "Fig 3A: lane 15, Primer 
RN-b" 



( 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CGGGATCCTT TAGACGGAGG CGTC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION : 1 . .66 

(D) OTHER INFORMATION :/note= "Fig 3A: lanes 13/16, RN 
VII" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATTGTTGCCT GCGAGGGTAA CCCTTACGTG CCTGTCCACT TCGACGCCTC CGTCTAAAGG 
ATCCCG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : mis cofeature 

(B) LOCATION : 1 . .23 

(D) OTHER INFORMATION :/note= "Fig 3B: lane 1, PCR Primer 
RNa" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GCGGATCCAT GAAGGAGACC GCC 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION : 1 . .56 

(D) OTHER INFORMATION: /note- "Fig 3B, lane 2, RN I" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43; 
GCGGATCCAT GAAGGAGACC GCCGCCGCCA AGTTCGAGCG CCAGCACATG GACAGC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME / KEY : mis cofeature 

(B) LOCATION: 1. . 18 

(D) OTHER INFORMATION : / note= "Fig 3B, lane 3, RN 7" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
CCACCGCCGC TGTCCATG 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATION : 1 . .57 

(D) OTHER INFORMATION :/note= "Fig 3B: lanes 2/4, RN VII 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGCGGTGGCG GTTCCGGTGG CGGTGGCAGC GGCGGCGGTG GTAGCAAGAT CTTCGGG 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION: 1. .18 

fO) OTHER INFORMATION :/note= "Fiq 3B: lane 5. RN c" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
CCCGAAGATC TTGCTACC 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 1. . 45 

(D) OTHER INFORMATION :/note= "Fiq 4, lane 1" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
AAAGAGACAG CAGCCGCAAA GTTTGAGCGT CAGCAT AT GG ATAGT 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .32 

(D) OTHER INFORMATION: /note= "Fig 4A: lane 2" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION : / note= ""Xaa" « corresponds to an 
ochre stop codon (UAA) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gin His Met Asp Ser 
1 5-io 15 

Xaa Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : mis c_f eature 

(B) LOCATION: 1. .63 

(D) OTHER INFORMATION: /note= "Fig 4A: lane 3" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GGATCCATGA AGGAGAC C GC CGCCGCCAAG TTCGAGCGCC AGC AC AT GGA CAGCTAAAGA 60 

TCT 63 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : mis cofeature 

(B) LOCATION :1. .106 

(D) OTHER INFORMATION: /no te« "Fig 4A: lane 4" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GGATCCATGA AGGAGACCGC CGCCGCCAAG TTCGAGCGCC AGCACATGGA CAGCGGCGGT 
GGCGGTTCCG GTGGCGGTGG CAGCGGCGGC GGTGGTAGCA AGATCT 
(2) INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION: 1. .330 

(D) OTHER INFORMATION : /note= "Fig 4B: lane 1" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

AGCACCAGTG CTGCCAGTTC TTCCAACTAC TGTAACCAGA T GAT GAAGTC TAGAAACTTG 60 

ACCAAGGACA GAT GTAAGC C AGTTAACACA TTTGTCCACG AGAGTTTGGC TGATGTCCAA 12 0 

GCCGTCTGCA GT C AGAAAAA CGTTGCATGC AAGAAC GGT C AAACGAACTG T T AC C AGAGT 180 

TACAGCACCA TGTCCATCAC TGACTGTCGT GAGACAGGCT CGAGCAAGTA TCCTAATTGT 24 0 

GCTTACAAGA CCACACAGGC GAACAAACAC ATCATTGTTG CTTGTGAAGG TAACCCTTAC 300 

GTTCCTGTCC ACTTTGACGC CAGTGTTTAA 33 0 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1. .132 

(D) OTHER INFORMATION :/note= "Fig 4B: lane 2" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Ser Thr Ser Ala Ala Ser Ser Ser Asn Tyr Cys Asn Gin Met Met 
15 10 15 

Lys Ser Arg Asn Leu Thr Lys Asp Arg Cys Lys Pro Val Asn Thr Phe 
20 25 30 

Val His Glu Ser Leu Ala Asp Val Gin Ala Val Cys Ser Gin Lys Asn 
35 40 45 

Val Ala Cys Lys Asn Gly Gin Thr Asn Cys Tyr Gin Ser Tyr Ser Thr 
50 55 60 

Met Ser He Thr Asp Cys Arg Glu Thr Gly Ser Ser Lys Tyr Pro Asn 
65 70 75 80 

Cys Ala Tyr Lys Thr Thr Gin Ala Asn Thr Asp Cys Arg Glu Thr Gly 
85 90 95 



( 



Ser Ser Lys Tyr Pro Asn Cys Ala Tyr Lys Thr Thr Gin Ala Asn Lys 
100 105 110 

His lie lie Val Ala Cys Glu Gly Asn Pro Tyr Val Pro Val His Phe 
115 120 125 

Asp Ala Ser Val 
130 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 
<S) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
:|3 (ix) FEATURE: 

ijj (A) NAME / KEY : misc_f eature 

y (B) LOCATION : 1 . - 330 

(D) OTHER INFORMATION : / note= "Fig 4B, lane 3" 

■ :(=3 
: _£ ; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
^ AGATCTATGA GCACCTCCGC CGCCAGCTCC TCCAACTACT GCAACCAGAT GATGAAGTCT 60 

m AGGAACCTGA CCAAGGACAG GTGCAAGCCA GTCAACACCT TCGTCCACGA GAGCCTGGCC 120 

q: GATGTCCAGG CCGTCTGCAG CCAGAAGAAC GTGGCCTGCA AGAACGGTCA GACCAACTGC 18 0 

g TACCAGTCCT ACAGCACCAT GTCCATCACC GACTGCCGCG AGAC CGGCT C CAGCAAGTAC 24 0 

CCTAACTGCG CCTACAAGAC CACCCAGGCC AACAAGCACA TCATTGTTGC CTGCGAGGGT 300 
AACCCTTACG TGCCTGTCCA CTTCGACGCC TCCGTCTAAA GGATCC 34 6 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE :. 

(A) NAME/ KEY : mi sc_f eature 

(B) LOCATION: 1. .331 

(D) OTHER INFORMATION: /no te= "Fig 4B, lane 4" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
AGATCTATGA GCTCCTCCAA CTACT GCAAC CAGATGATGA AGT CTAGGAA CCTGACCAAG 



60 




GACAGGTGCA AGCCAGTCAA 



CACCTCCGTC C AC GAGAGC C TGGCCGATGT CCAGGCCGTC 



120 



TGCAGCCAGA AGAACGT GGC 



CTGCAAGAAC GGT C AGACC A ACTGCTACCA GTCCTACAGC 



180 



AC CAT GT CCA TCACCGACTG 



CCGCGAGACC GGCTCCAGCA AGTACCCTAA CTGCGCCTAC 



240 



AAG AC CACAC AGGCCAACAA 



GCACAT CAT T GTTGCCTGCG AGGGTAACCC TTACGTGCCT 



300 



GTCCACTTCG ACGCCTCCGT CTAAAGGATC C 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: miscjeature 

(B) LOCATION: 1. .163 

(D) OTHER INFORMATION : /note— "Fig 4C i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
TCTAGATCTT AACATGAAGA ATGTTTTAGT AAGGT CAGCT GCGCGAGCTC TGCTTGGCGG 60 
CGGTGGGCGG AGCTACTACC GCCAGCTCTC AACGGCGGCG AT C GT GGAAC AGAGACACCA 12 0 

GCACGGTGGC GGCGCGTTTG GAAGCTTCCA CTTAAGCGGA TCC 163 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION : 1 . .198 

(D) OTHER INFORMATION:/ no te= "Fig 4C ii" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

AT GAAGAAT G TTTTAGTAAG GTCAGCTGCG CGAGCTCTGC TTGGCGGCGG TGGGCGGAGC 60 

TACTACCGCC AGCTCTCAAC GGCGGCGATC GT GGAACAGA GACACCAGCA CGGTGGCGGC 120 

GCGTTT GGAA GCTT CCACTT AAGAAGGATG AAGGAGACCG CCGCCGCCAA GTTCGAGCGC 180 




CAGCACATGG ACAGCTAA 198 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: mis cjeature 

(B) LOCATION: 1. .27 0 

(D) OTHER INFORMATION :/note= M Fig 4c iii" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

AT GAAG AAT G TTTTAGTAAG GTCAGCTGCG CGAGCTCTGC TTGGCGGCGG TGGGCGGAGC 60 

TACTACCGCC AGCTCTCAAC GGCGGCGATC GTGGAACAGA GACACCAGCA CGGTGGCGGC 12 0 

GCGTTTGGAA GCTTCCACTT AAGAAGGATG AAGGAGACCG CCGCCGCCAA GTTCGAGCGC 18 0 

CAGCACATGG ACAGCGGCGG TGGCGGTTCC GGTGGCGGTG GCAGCGGCGG CGGTGGTAGC 24 0 

GGGATCCCCG GGTACGGTCA GTCCCTTATG 27 0 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION : 1 , .465 

(D) OTHER INFORMATION : / note= "Fig 4C iv 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

AT GAAGAAT G TTTTAGTAAG GTCAGCTGCG CGAGCTCTGC TTGGCGGCGG TGGGCGGAGC 60 

TACTACCGCC AGCTCTCAAC GGCGGCGATC GTGGAACAGA GACACCAGCA CGGTGGCGGC 120 

GCGTTTGGAA GCTTCCACTT AAGAAGGATG AGCTCCTCCA ACT ACT G CAA CCAGATGATG 180 

AAGTCTAGGA ACCTGACCAA GGACAGGTGC AAGCCAGTCA ACACCTCCGT CCACGAGAGC 240 

CTGGCCGATG TCCAGGCCGT CTGCAGCCAG AAGAACGTGG CCTGCAAGAA CGGTCAGACC 300 

AACTGCTACC AGT CCTACAG CACCATGTCC ATCACCGACT GCCGCGAGAC CGGCTCCAGC 360 



AAGTACCCTA ACTGCGCCTA CAAGACCACA CAGGCCAACA AGCACAT CAT TGTTGCCTGC 
GAGGGTAACC CTTACGTGCC TGTCCACTTC . GACGCCTCCG TCTAA 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION: 1. .715 

(D) OTHER INFORMATION: /note= "Fig 4C v" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



ATGCAGATCT 


TCGTGAAAAC 


CTTGACCGGC 


AAGAC CAT C A 


CTCTCGAGGT 


CGAGAGCAGC 


60 


GACAC C AT CG 


ACAATGTCAA 


GGCCAAGATC 


CAAGACAAAG 


AAGGT AT CAT 


TCTTCCTCAC 


120 


TCAATCTGGA 


TTCTTCTCTT 


TAGCTTTTTG 


AAATT CAGAT 


CTCTTATCAT 


TTACTTGTTT 


180 


CTCCTTTAAG 


GAATCCCTCC 


GGAT CAGCAG 


AGATTGATCT 


TCGCCGGAAA 


GCAGCTCGAA 


240 


GATGGCCGTA 


CTTTGGCTGA 


CTACAACATC 


CAGAAAGGTA 


CGAAATCATC 


CGAATCCTTC 


300 


TGTTGATCAT 


TTCGATGATC 


TGATTGTATA 


AACTCTAATG 


GATTGTTATC 


ATTTGTAAAC 


360 


AGAATCTACA 


CTTCATCTTG 


TGTTGAGGCT 


TAGAGGTGGA 


TCCAGCTCCA ACTACTGCAA 


420 


CCAGAT GAT G 


AAGTCTAGGA 


ACCTGACCAA 


GGACAGGTGC 


AAGCCAGTCA 


ACACCTCCGT 


480 


CCACGAGAGC 


CTGGCCGATG 


TCCAGGCCGT 


CTGCAGCCAG 


AAGAACGTGG 


CCT GCAAGAA 


540 


CGGTCAGACC 


AACTGCTACC 


AGTCCTACAG 


CACCATGTCC 


AT CACCGACT 


GCCGCGAGAC 


600 


CGGCTCCAGC 


AAGTACCCTA 


ACTGCGCCTA 


CAAGACCACA 


CAGGCCAACA 


AGCACAT CAT 


660 


TGTTGCCTGC 


GAGGGTAACC 


CTTACGTGCC 


TGTCCACTTC 


GACGCCTCCG 


TCTAA 


715 



(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GGTGGATCCA GCTCCAACTA CTGCAAC 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CGGGATCCTT AGACGGAGGC GTCG 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GTCCTTAAGA AGGAT GAGCT CCTCCAACTA C 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CGGGATCCTT AGACGGAGGC GTCG 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 
GTCCTTAAGA AGGATGAAGG AGACCGCCG 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
TCGGGATCCT TAGCTGTCCA TGTGCTG 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 
TCGGGATCCT CATTGTTTGC CTCCCTG 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
TGCTCTAGAT CTTAACATGA AGAATGTTTT AG 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 
TCGGATCCGC TTAAGT GGAA GCTTCCAAAC 



